Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation

نویسندگان

Yamato Ohtani

Tomoki Toda

Hiroshi Saruwatari

Kiyohiro Shikano

چکیده

The performance of voice conversion has been considerably improved through statistical modeling of spectral sequences. However, the converted speech still contains traces of artificial sounds. To alleviate this, it is necessary to statistically model a source sequence as well as a spectral sequence. In this paper, we introduce STRAIGHT mixed excitation to a framework of the voice conversion based on a Gaussian Mixture Model (GMM) on joint probability density of source and target features. We convert both spectral and source feature sequences based on Maximum Likelihood Estimation (MLE). Objective and subjective evaluation results demonstrate that the proposed source conversion produces strong improvements in both the converted speech quality and the conversion accuracy for speaker individuality.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Context-based Statistical Models to Promote the Quality of Voice Conversion Systems

This article aims to examine methods of optimizing GMM-based voice conversion systems performance in which GMM method is introduced as the basic method for improvement of voice conversion systems performance. In the current methods, due to using a single conversion function to convert all speech units and subsequent spectral smoothing arising from statistical averaging, we will observe quality ...

متن کامل

Straight-based voice conversion algorithm based on Gaussian mixture model

The voice conversion algorithm based on the Gaussian mixture model (GMM) has also been proposed by Stylianou et al. In this algorithm, the acoustic space of a speaker is represented continuously. In this paper, we apply this GMMbased voice conversion algorithm to STRAIGHT proposed by Kawahara et al., which is recognized as a high quality vocoder. In order to evaluate this voice conversion algor...

متن کامل

An improved one-to-many eigenvoice conversion system

We have previously developed a one-to-many eigenvoice conversion (EVC) system enabling the conversion from a specific source speaker’s voice into an arbitrary target speaker’s voice. In this system, eigenvoice Gaussian mixture model (EV-GMM) is trained in advance with multiple parallel data sets composed of utterance pairs of the source and many pre-stored target speakers. The EV-GMM is effecti...

متن کامل

Evaluation of cross-language voice conversion based on GMM and straight

Voice conversion is a technique for producing utterances using any target speakers’ voice from a single source speaker’s utterance. In this paper, we apply cross-language voice conversion between Japanese and English to a system based on a Gaussian Mixture Model (GMM) method and STRAIGHT, a high quality vocoder. To investigate the effects of this conversion system across different languages, we...

متن کامل

A GMM-STRAIGHT Approach to Voice Conversion

This paper explores the topic of voice conversion as explored in a joint project with Percy Liang (EECS, Berkeley). For our purposes, voice conversion is the process of modifying the speech signal of one speaker (source) such that it sounds as though it had been pronounced by a different speaker (target). Following the Source-Filter model of speech production, we begin by assuming that most of ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2006

Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation

نویسندگان

چکیده

منابع مشابه

Using Context-based Statistical Models to Promote the Quality of Voice Conversion Systems

Straight-based voice conversion algorithm based on Gaussian mixture model

An improved one-to-many eigenvoice conversion system

Evaluation of cross-language voice conversion based on GMM and straight

A GMM-STRAIGHT Approach to Voice Conversion

عنوان ژورنال:

اشتراک گذاری